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Abstraction  this  paper,  we-fcse  crossing  number  and  wire  area  argument^to  find 
lower  bounds  on  the  layout  area  and  maximum  edge  length  of  a  variety  of  new  and 
computationally  useful  networks.  In  particular,  we  describe 

^an  A-nodc  planar  graph  which  has  layout  area J^NloglV)  and  maximum  edge 
length  t  wsr* 

2)  an  A-node  graph  with  an  j3(x//,?)-separator  which  has  layout  area  0iN/o^bf) 
and  maximum  edge  length  0(N,/:!logN/loglogN)%  and 

3)  an  A-node  graph  with  an  p(x^* ^-separator  which  has  maximum  edge  length 
/0(A,/*//O  for  any  r  ,3. 
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1.  Introduction 

Recent  advances  in  Very  Large  Scale  Integration  (VLSI)  circuit  technology  have 
made  it  possible  to  wire  tens  of  thousands  of  transistors  onto  a  single  chip.  In  the 
near  fututre,  it  is  expected  that  fabrication  of  chips  containing  millions  of  transistors 
will  be  commonplace  [MC80].  In  order  that  this  massive  computational  resource  be 
effectively  utilized,  theoretical  researchers  have  been  actively  trying  to  develop 
models  and  methods  for  designing  VLSI  chips.  Most  of  these  efforts  have  been 
directed  towards  producing  network  layouts  which  minimize  the  amount  of  area 
consumed  by  the  chip.  This  is  due  to  the  fact  that  small  chips  are  usually  much 
cheaper  and  more  reliable  than  large  chips. 

Of  the  several  mathematical  models  that  have  been  proposed  for  VLSI 
computation,  the  simplest  and  most  widely  accepted  is  the  Thompson  grid  model 
[T79,  T80].  The  grid  model  of  a  VLSI  chip  is  quite  simple.  The  chip  is  presumed  to 
consist  of  a  grid  of  horizontal  and  vertical  tracks  which  are  spaced  apart  by  unit 
intervals.  The  nodes  of  the  network  are  viewed  as  points  and  are  located  only  at  the 
intersection  of  grid  tracks.  Wires  are  routed  through  the  tracks  in  order  to  connect 
pairs  of  nodes.  Although  a  wire  in  a  horizontal  track  is  allowed  to  cross  a  wire  in  a 
vertical  track  (without  making  an  electrical  connection),  pairs  of  wires  are  not 
allowed  to  overlap  for  any  distance  or  to  overlap  at  corners  (i.e.,  they  cannot  overlap 
in  the  same  track).  Further,  wires  are  not  allowed  to  overlap  nodes  to  which  they  are 
not  linked.  (The  routing  of  wires  in  this  fashion  is  also  known  as  layer  per  direction 
routing  and  Manhattan  routing .) 

The  area  of  a  layout  in  the  grid  model  is  defined  to  be  the  product  of  the  number 
of  horizontal  tracks  and  the  number  of  vertical  tracks  which  contain  a  node  or  wire 
segment  of  the  layout.  For  example,  the  layout  shown  in  Figure  1  has  area  15. 
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It  is  easy  to  construct  a  layout  for  the  graph  shown  in  Figure  1  which  has  area  9, 
implying  that  the  layout  in  Figure  1  is  not  optimal.  In  general,  however,  it  is  very 
difficult  to  tell  if  a  layout  is  optimal  or  even  close  to  optimal.  ITiis  is  due  to  the  fact 
that  relatively  little  is  known  about  proving  lower  bounds  on  the  area  needed  to  lay 
out  a  graph.  Past  research  in  this  area  has  centered  on  the  related  problem  of  finding 
good  lower  bounds  on  the  bisection  width  of  a  graph  [T79,  T80,  V80,  LS81]  (The 
biseciion  width  of  a  graph  is  the  minimum  number  of  edges  which  must  be  removed 
in  order  to  partition  the  graph  into  two  neariy-equal-sized  subgraphs.) 

The  relationship  between  bisection  width  and  layout  area  was  first  noticed  by 
Thompson  [T79]  who  showed  that  the  layout  area  of  a  graph  with  bisection  width  B 
is  at  least  Sl(B*).  Since  that  time,  Thompson  [T80],  Vuillemin  [V80]  and  Upton  and 
Sedgewick  [LS81]  have  all  shown  how  to  use  information  theoretic  arguments  in 
order  to  find  good  lower  bounds  on  the  bisection  width  of  a  graph  and  thus  on  its 
layout  area.  While  these  methods  are  useful  in  finding  good  lower  bounds  on  the 
layout  area  of  some  networks  (such  as  the  shuffle-exchange  graph),  they  have  not 
been  of  use  in  resolving  two  of  the  key  open  questions  in  VLSI  theory;  namely, 

1)  "How  much  area  is  needed  to  lay  out  a  planar  graph?,"  and 

2)  "How  much  area  is  required  to  lay  out  a  graph  with  an  0(  a  ^-separator?." 
(An  A-node  graph  is  said  to  have  an  J[x)- separator  if  it  can  be  partitioned  into 
two  nearly-equal-sized  subgraphs  Gy  and  G2  such  that  at  most  / N)  edges  link 
Gt  to  G2  and  both  Gj  and  G2  have  /^-separators.) 

The  planar  graph  question  is  particularly  important  since  (as  we  will  show  in 
Theorem  2)  the  layout  problem  for  an  arbitrary  graph  can  be  reduced  to  that  for  a 
planar  graph.  Although  no  nontrivial  lower  bounds  are  known  for  either  problem, 
progress  has  been  made  on  the  corresponding  upper  bounds.  In  particular,  Leiserson 
[L80a]  and  Valiant  [V81J  have  shown  how  to  lay  out  any  /V-node  graph  which  has  an 
0(x//?)-separator  in  at  most  0{Nlog?N)  area.  As  Lipton  and  Tarjan  (LT80J  have 
proved  an  0(x//2)-separator  theorem  for  the  class  of  planar  graphs,  their  0(Nlog2N) 
area  layout  technique  also  works  for  planar  graphs.  Although  it  is  suspected  that 
better  layout  techniques  exist  for  planar  graphs,  none  have  yet  been  found. 

In  this  paper,  we  pursue  an  entirely  different  strategy  in  developing  new  lower 
bound  techniques  for  VLSI.  Whereas  previous  researchers  have  been  concerned 
primarily  with  the  bisection  width  of  a  network,  we  shall  be  concerned  with  its 
crossing  number  and  wire  area  The  crossing  number  of  a  graph  is  the  minimum 


number  of  pairs  of  edges  which  must  cross  in  any  planar  drawing  of  the  graph.  The 
wire  area  of  a  graph  is  the  minimum  amount  of  wire  which  is  needed  to  lay  out  the 
graph  in  the  Thompson  grid  model  [T79,  T80].  Clearly,  the  crossing  number  and 
wire  area  are  lower  bounds  on  the  layout  area  of  any  graph.  In  fact,  we  will  show  in 
Theorem  1  that 

tyB2)  <  C+N  <  W  <  A 

for  any  /V-node  graph  with  bisection  width  B,  crossing  number  C,  wire  area  W  and 
layout  area  A. 

The  preceding  inequality  implies  that  every  lower  bound  technique  for  bisection 
width  can  be  translated  into  a  lower  bound  technique  for  crossing  number  and  wire 
area.  Thus  nothing  is  lost  by  forgetting  about  bisection  width  and  concentrating  ones 
efforts  on  finding  good  lower  bounds  for  the  crossing  number  and  wire  area  of  a 
graph.  In  fact,  much  can  be  gained.  For  example,  we  will  use  such  techniques  in 
this  paper  to  construct 

1)  an  ./V-node  planar  graph  which  has  layout  area  O(NlogN)  and 

2)  an  /V-node  (nonplanar)  graph  with  an  0(x//2)-separator  which  has  layout  area 
Q(Nlog2N). 

The  first  result  demonstrates  that  not  all  planar  graphs  can  be  laid  out  in  linear 
area,  thus  disproving  a  popular  conjecture.  The  second  result  indicates  that 
Leiserson  and  Valiant's  0(/V/og^/V)-area  layout  technique  for  graphs  with  0(x,/2y 
separators  is  optimal  at  least  some  of  the  time  and  thus  cannot,  in  general,  be 
improved. 

There  has  also  been  a  great  deal  of  interest  lately  in  the  problem  of  determining 
the  length  of  the  longest  wire  in  any  layout  of  a  network  on  a  chip  [BL81,  CM81, 
PRS81j.  Bhatt  and  Leiserson  [BL81],  in  particular,  have  found  some  nice  upper 
bounds  for  this  problem.  Very  little  has  been  accomplished  in  the  way  of  lower 
bounds,  however,  since  bisection  width  arguments  do  not  seem  to  be  applicable  to 
edge  length  considerations.  Crossing  number  and  wire  area  arguments,  on  the  other 
hand,  are  very  helpful  in  proving  good  lower  bounds  on  maximum  edge  length.  In 
this  paper,  we  will  use  such  arguments  to  find 

1)  an  /V-node  planar  graph  for  which  any  layout  must  have  a  wire  of  length 
Q(Nl/2/logl/2N), 
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2)  an  N- node  (nonplanar)  graph  with  an  CKx^-separator  for  which  any  layout 
must  have  a  wire  of  length  0( Nl/2logN/loglogN)  and 

3)  an  JV-node  graph  with  an  CK-x^O'Separator  for  which  any  layout  must  have 
a  wire  of  length  B(Nhl/r)  for  any  r>3. 

The  latter  two  results  achieve  the  known  upper  bounds  for  maximum  wire  length. 
They  also  indicate  that  some  wires  in  layouts  of  certain  graphs  must  be  very  long 
(possibly  as  long  as  the  length  of  the  entire  layout). 

For  easy  reference,  we  have  summarized  our  new  lower  bounds  along  with  the 
previously  known  upper  and  lower  bounds  in  the  Tables  1  and  2.  The  nontrivial 
upper  bounds  in  Table  1  are  due  to  Leiserson  [L80a]  and  Valiant  [V81]  while  those  in 
Table  2  are  due  to  Bhatt  and  Leiserson  [BL81].  The  previously  known  lower  bounds 
are,  for  the  most  part,  trivial.  The  only  exception  is  the  N2a  area  lower  bound  in 
Table  1  which  is  due  to  Thompson  [T80].  In  each  table,  the  upper  bounds  apply  to 
all  graphs  while  the  lower  bounds  pertain  only  to  a  special  class  of  graphs.  (For 
convenience,  we  have  left  out  the  0(  )  notation  on  the  upper  bounds  and  the  fl( ) 
notation  on  the  lower  bounds.) 


Table  1 


Area  Bounds 


separator 

previous 
lower  bound 

our 

lower  bound 

upper 

bound 

x«,  a  <  1/2 

N 

N 

x",  a  =  1/2 

N 

Nlog2N 

NIog2N 

x°,  a  >  1/2 

N2a 

N2a 

la! 


Table  2 


Maximum  Edge  Length  Bounds 


separator 

previous 
lower  bound 

OUT 

lower  bound 

upper 

bound 

x°,  a  <  1/2 

Nl/2/logN 

NI/2/logN 

xa,a  =  l/2 

NI/2/logN 

N1/2logN/loglogN 

N1/2logN/loglogN 

x°,a>  1/2 

Na/logN 

N« 

Na 

(planar) 

Nl/2/logN 

Nl/2/logl/2N 

N,/2logN/loglogN 

The  remainder  of  the  paper  is  organized  as  follows.  In  section  2,  we  describe  the 
networks  for  which  we  will  later  prove  lower  bounds.  As  these  networks  are  new 
and  interesting  in  their  own  right  (they  include  new  networks  for  fast  sorting  and 
matrix  multiplication),  we  devote  a  fair  amount  of  space  to  each.  In  section  3,  we 
describe  the  general  relationship  between  crossing  number  and  layout  area.  We  also 
prove  crossing  number  and  maximum  edge  crossing  lower  bounds  for  the  nonplanar 
networks  described  in  section  2.  In  section  4,  we  generalize  the  methods  developed 
in  section  3  and  prove  wire  area  and  maximum  edge  length  lower  bounds  for  a 
variety  of  networks.  We  conclude  in  section  5  with  several  remarks,  related  results 
and  open  questions. 

Throughout,  we  limit  our  discussion  of  graphs  to  those  with  bounded  node 
degree  and  of  layouts  to  those  in  the  Thompson  grid  model.  Neither  constraint  is 
crucial  to  our  results,  however.  For  example,  all  of  the  lower  bound  proofs  work 
equally  well  for  the  Lipton-Sedgewick  [LS81J  (or  any  similar)  model  of  chip  design. 


2.  Network  Constructions 

(2a)  The  2-dimensional  Mesh  of  Trees 

The  2-dimensional  nxn  mesh  of  trees  M2  n  is  defined  when  n  is  a  power  of  2  as 
follows.  Starting  with  an  nxn  matrix  of  nodes  and  adding  nodes  wherever  necessary, 


construct  a  complete  binary  tree  in  each  row  and  column  of  the  matrix.  The  trees 
should  be  constructed  so  that 

1)  the  leaves  in  each  tree  are  precisely  the  nodes  in  the  corresponding  row  or 
column  of  the  original  matrix,  and 

2)  the  subgraph  induced  on  the  nodes  in  each  quadrant  is  M2n/2  • 

For  example,  we  have  drawn  M24  in  Figure  2.  The  nodes  in  the  original  4x4 
matrix  are  represented  by  dots.  The  nodes  which  were  added  in  order  to  form  row 
trees  are  drawn  as  small  triangles  while  those  added  to  form  column  trees  are  shown 
as  small  squares.  Solid  lines  indicate  row  tree  edges  while  dashed  lines  indicate 
column  tree  edges. 


A. 


Figure  2:  The  4x4  mesh  of  trees  M^4 


Notice  that  if  we  were  to  remove  the  roots  of  the  row  and  column  trees  of  M2 4 
and  the  edges  incident  to  them,  we  would  be  left  with  4  copies  of  M2  2 ,  one  in  each 
quadrant  In  general,  if  we  were  to  remove  the  nodes  and  edges  in  the  top  k  levels 


6 


of  the  binary  trees  in  M2ll  ,  we  would  be  left  with  22k  copies  of  M2n2'k  •  Th‘s 
important  property  of  meshes  of  trees  is  used  extensively  in  the  proofs  of  Theorems 
3,  4,  5  and  8. 

Computationally,  the  nxn  mesh  of  trees  is  a  very  powerful  network.  Among 
other  things,  it  can  be  used  to 

1)  multiply  a  fixed  nxn  matrix  by  m  different  ^-vectors  in  m+2logn  (word) 
steps, 

2)  sort  a  list  of  n  m-bit  words  in  2m+5logn  (bit)  steps,  and 

3)  link  n  input  terminals  to  n  output  terminals  in  any  order  in  /ogn  (bit)  steps. 

The  processors  and  algorithms  needed  for  these  operations  are  extremely  simple. 
For  example,  in  order  to  sort  a  list  of  n  m-bit  words,  each  node  need  only  contain  a 
few  and  and  or  gates.  The  algorithm  for  this  operation  proceeds  as  follows.  Starting 
at  the  roots,  the  ilh  word  to  be  sorted  is  input  (bit  by  bit)  into  the  ilh  row  and 
column  trees  for  each  /,  l<i<n.  The  bits  are  passed  down  each  tree  so  that  after 
logn  steps  the  leading  bit  of  the  ilh  word  has  reached  each  leaf  of  the  ilh  row  and 
column  trees.  Comparison  of  the  ilh  and  jth  words  for  all  /  and  j  can  then  proceed 
simultaneously.  After  at  most  m  additional  steps,  the  (ij)  leaf  has  decided  whether 
the  ilh  word  is  smaller  or  larger  than  the  jth  word.  Tics  are  broken  arbitrarily  (e.g., 
depending  on  the  values  of  /  and  j).  Once  this  is  done,  each  leaf  transmits  a  0  or  a  / 
to  its  column  tree  father  depending  on  whether  its  column  tree  word  was  smaller  or 
larger  than  its  row  tree  word.  Each  column  tree  then  sums  these  values  in  order  to 
determine  the  position  of  its  word  in  the  final  ordering.  (If  the  sum  is  carried  out  bit 
by  bit  starting  with  the  least  significant  bit,  this  process  takes  2logn  steps.)  This 
information  is  then  used  to  mark  a  path  in  each  column  tree  from  the  root  to  that 
leaf  which  is  also  in  the  appropriate  row  tree  (again  taking  2logn  steps).  Once  this  is 
done,  it  is  a  simple  matter  to  transmit  the  bits  of  the  ith  word  along  the  unique  path 
from  the  ilh  column  tree  root  to  the  appropriate  row  root  for  each  /.  As  the  paths 
are  all  pairwise  disjoint,  this  process  takes  only  m+2logn  steps. 

The  algorithm  just  described  sorts  a  list  of  n  m- bit  numbers  in  2m-h7logn  steps. 
It  is  a  simple  exercise  to  speed  up  the  algorithm  to  obtain  the  2m+5logn  step  bound. 
We  should  also  point  out  that  this  algorithm  is  similar  to  the  one  described  by 
Muller  and  Preparata  in  [MP75].  The  VLSI  implementation  of  the  algorithm  is  new, 
however,  and  far  superior  to  many  of  the  VLSI  sorting  algorithms  discussed  by 
Thompson  in  his  recent  survey  paper  [T81]. 
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It  is  easy  to  show  that  the  nxn  mesh  of  trees  has  N- 3n2- 2  n  nodes,  bisection 
width  n=9(Nl/2)t  and  an  0(x//2)-separator.  In  Theorems  3,  5  and  8  (respectively) 
we  will  show  that  any  layout  for  the  JV-node  2-dimensional  mesh  of  trees  has 

1)  at  least  Q(NlogN)  crossings, 

2)  at  least  Q(Nlog2N)  wire  area,  and 

3)  some  edge  of  length  at  least  Nl/2logN/loglogN). 

(It  is  worth  noting  that  all  of  these  bounds  are  tight.) 

(2b)  The  rdimensional  Mesh  of  Trees 

The  2-dimensional  mesh  of  trees  can  be  easily  generalized  to  higher  dimensions. 
For  example,  the  3-dimensional  nxnxn  mesh  of  trees  A/?  n  can  be  constructed  as 
follows.  Starting  with  an  nxnxn  cube  of  nodes  and  adding  nodes  wherever  necessary, 
construct  a  set  of  n2  complete  binary  trees  in  each  of  the  three  dimensions  of  the 
cube.  As  before,  the  trees  should  be  constructed  so  that  the  leaves  are  precisely  the 
nodes  of  the  original  cube  and  so  that  the  subgraph  induced  on  each  octant  of  nodes 

is  M3.n/2  • 

The  nxnxn  mesh  of  trees  is  also  a  very  useful  network  for  parallel  computation. 
For  example,  it  can  be  used  to  compute  the  products  of  m  pairs  of  nxn  matrices  in 
m  +  2logn  (word)  steps.  The  algorithm  and  processors  needed  for  this  operation  are 
quite  simple.  The  algorithm  proceeds  as  follows.  At  each  time  step,  a  pair  of 
matrices  is  entered  into  the  network  via  the  roots  of  the  trees  in  two  of  the 
dimensions  (one  dimension  for  each  matrix).  The  entries  are  passed  down  through 
the  trees  so  that  after  logn  steps,  the  leaf  in  the  (r,s.t)  position  of  the  cube  contains 
the  (/;$)  entry  of  the  first  matrix  and  the  ( s,t )  entry  of  the  second  matrix  for  each  r,s 
and  t.  All  n-?  multiplications  can  then  be  performed  simultaneously.  The  entries  of 
the  product  matrix  are  then  calculated  by  summing  the  values  of  the  leaves  of  each 
tree  in  the  third  (previously  unused)  dimension.  This  process  takes  an  additional 
logn  steps.  As  the  network  is  easily  pipelined,  it  is  clear  that  die  total  computation 
time  is  just  m+2logn  (word)  steps. 

A  simple  counting  argument  reveals  that  Mi  n  has  N=4n2-2n2  nodes  for  each  n 
and  that  the  class  of  such  graphs  has  an  0(.\ ^-separator  theorem.  Thus  the  N- 
node  2-dimensional  mesh  of  trees  can  be  laid  out  in  0(N4/*)=O(n4)  area.  As  the 
the  bisection  width  of  the  network  has  size  9{N2/i\  we  can  conclude  that  the  layout 
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area  is  precisely  Q(N4/3).  At  the  same  time,  we  can  also  observe  that  the  network 
nearly  achieves  the  optimal  AT2  bound  for  matrix  multiplication  [T79].  Although 
Preparata  and  Vuillemin  [PV80]  have  already  found  an  optimal  network  for  fast 
matrix  multiplication,  it  appears  that  M3ll  is  much  simpler  and  far  easier  to  program. 

Our  primary  interest  in  the  higher  dimensional  meshes  of  trees  is  not  in  their 
computational  power,  however.  We  are  interested  more  in  the  fact  that  all  layouts 
for  such  graphs  must  have  very  long  edges.  In  fact,  we  will  show  in  Theorem  4  that 
any  layout  for  the  N-node  r-dimensiona!  mesh  of  trees  must  have  an  edge  of  length 
Q(N,~,/r)  for  any  N  and  r>3.  ft  is  not  difficult  to  show  that  the  ^dimensional  mesh 
of  trees  has  an  O(x/  //r)-separator  and  thus  that  such  edges  are  (up  to  a  constant)  as 
long  as  the  side  length  of  any  optimal  layout. 

(2c)  The  Tree  of  Meshes 

The  tree  of  meshes  is  similar  to  the  2-dimensional  mesh  of  trees  in  that  it 
combines  the  structure  of  a  mesh  with  that  of  a  complete  binary  tree  in  a  natural 
way.  Unlike  the  2-dimensional  mesh  of  trees,  however,  the  tree  of  meshes  is  a  planar 
graph.  It  is  formed  by  replacing  each  node  of  a  complete  binary  tree  with  a  mesh 
and  each  edge  by  several  edges  which  link  the  meshes  together.  More  precisely,  the 
root  of  the  binary  tree  is  replaced  by  an  nxn  mesh  (where  n  is  assumed  to  be  a  power 
of  2),  its  sons  are  replaced  by  n/2  x  n  meshes,  their  sons  are  replaced  by  n/2  x  n/2 
meshes,  and  so  on  until  the  leaves  are  replaced  by  lxl  meshes.  In  the  place  of  each 
right  edge  of  the  binary  tree  (i.e.,  one  which  links  a  node  to  its  right  son),  we  link  the 
rightmost  column  of  nodes  in  the  mesh  corresponding  to  the  father  to  the  topmost 
row  of  nodes  in  the  mesh  corresponding  to  the  right  son.  Similar  replacements  are 
made  for  left  edges  of  the  binary  tree.  In  both  cases,  the  connections  are  made  so  as 
to  preserve  the  column  and  row  order  of  the  nodes  and  to  insure  that  the  resulting 
graph  is  planar.  A  simple  counting  argument  reveals  that  the  resulting  graph  (which 
we  call  the  nxn  tree  of  meshes  Tn)  has  N=2n2logn+n 2  nodes.  For  example,  we 
have  drawn  T4  in  Figure  3. 

The  tree  of  meshes  is  a  particularly  interesting  planar  graph  since  it  can  embed 
arbitrary  planar  graphs  much  more  efficiently  than  can  the  ordinary  mesh.  For 
example,  it  is  not  known  how  to  embed  an  arbitrary  planar  graph  in  less  than  an 
QiNlog2!^ /)-node  mesh.  As  we  show  in  our  thesis  [L81a],  however,  any  A’-node 
planar  graph  can  be  embedded  in  an  0(NlogN)-node  tree  of  meshes.  Thus  the  tree 


of  meshes  is  an  excellent  candidate  for  a  planar  graph  which  cannot  be  laid  out  in 
linear  area.  In  fact,  we  will  show  in  Theorem  6  that  the  A-node  tree  of  meshes  has 
layout  area  Q(NlogN). 

The  tree  of  meshes  can  also  be  used  to  embed  many  nonplanar  graphs  which 
have  0(x//2)-separators.  Of  particular  interest  here  is  the  fact  that  M2n  can  be 
embedded  in  hn  for  any  n.  In  order  to  construct  such  an  embedding,  we  first 
embed  (recursively)  four  copies  of  M2n/2  *n  f°l,r  copies  of  Tn  (one  in  each).  Next, 
we  embed  the  roots  of  M2n  in  the  2n  x  2n  mesh  of  'Ihe  embedding  of  M2n 
is  completed  by  using  the  n  x  2n  meshes  of  T2n  as  switching  networks  to  link  the 
roots  of  M2  n  to  its  four  subgraphs  M2  fl/2  (each  of  which  is  already  embedded  in  a 
copy  of  7y. 

As  an  example  of  the  procedure,  we  have  included  the  embedding  of  M24  in  T8 
in  Figure  4.  The  embedding  has  been  drawn  as  though  it  were  constructed  as  part  of 
a  larger  embedding  (say,  of  M2S)  in  order  to  illustrate  the  recursive  nature  of  the 
embedding  algorithm.  In  addition,  we  have  drawn  the  nodes  and  edges  of  M 24  as 
they  appear  in  Figure  2.  For  clarity,  we  have  represented  the  nodes  of  T8  as 
pinpoints  and  omitted  its  edges  altogether.  (For  a  more  complete  description  of  the 
embedding  algorithm,  we  again  refer  the  reader  to  [L81a].) 
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embedding  of  H 


(bottom  two  levels  mused) 


Figure  4:  The  embedding  of  M24  in  Ts . 


As  we  will  show  in  Theorem  5,  any  layout  for  A/>„  requires  ^l(n2log^n)  area. 
Thus  it  is  easy  to  see  that  any  layout  for  hn  also  requires  Qin^log^n)  area. 
Equivalently,  any  layout  for  the  A- node  tree  of  meshes  requires  at  least  Q(NlogN) 
area.  (It  is  worth  noting  that  an  0(/V/o£jV)-area  layout  for  the  Af-node  tree  of  meshes 
can,  in  fact,  be  constructed  by  expanding  the  standard  H*tree  layout  for  binary  trees.) 


(2d)  The  Augmented  Tree  of  Meshes 


In  [L81a],  we  show  that  the  maximum  edge  length  of  the  N- node  tree  of  meshes 
is  Q(logN).  By  slightly  modifying  the  graph,  however,  it  is  possible  to  increase  the 
maximum  edge  length  dramatically.  The  basic  idea  is  to  add  a  complete  binary  tree 
with  n2  leaves  to  the  nxn  tree  of  meshes  so  that  the  leaves  of  one  are  linked  in  a  one- 
to-one  fashion  to  the  leaves  of  the  other.  It  is  important  that  the  attachments 
between  the  two  graphs  be  made  so  that  the  resulting  graph  (which  we  call  the  nxn 
augmented  tree  of  meshes  Tn')  is  planar.  For  example,  we  have  drawn  the  4x4 
augmented  tree  of  meshes  in  Figure  5. 


tree  of  mtahta 


Figure  5:  The  4x4  augmented  tree  of  meshes  T4 ' . 


It  is  easily  seen  that  the  augmented  tree  of  meshes  has,  up  to  a  constant,  the  same 
bisection  width,  separator,  layout  area  and  number  of  nodes  as  does  the  original  tree 
of  meshes.  By  adding  the  binary  tree,  we  have  simply  decreased  the  distance  (i.e., 
the  length  of  the  shortest  path)  between  any  two  leaves  of  the  tree  of  meshes.  In 
section  4,  we  will  show  that  any  layout  of  the  /V-node  tree  of  meshes  must  have  two 
leaves  which  are  spaced  at  least  Q(Nl/2logl/2N)  apart  We  will  thus  be  able  to 
conclude  that  the  maximum  edge  length  of  the  /V  node  augmented  tree  of  meshes  is 
at  least  Q(Nl/2/logl/2N).  Using  arguments  similar  to  those  found  in  [BL81],  it  can 
be  shown  that  this  bound  is  tight 

3.  Crossing  Number  Arguments 

(3a)  General  Results 

We  first  demonstrate  the  power  of  the  crossing  number  as  a  general  lower  bound 
technique  for  layout  area. 

Theorem  1:  If  G  is  an  N'tiode  graph  with  crossing  number  C  and  bisection  width 
B.  then  C+N  >  tyB2). 

Proof:  Let  D  be  a  drawing  of  G  in  the  plane  with  C  crossings.  Replace  each 
crossing  of  D  with  an  artificial  node.  Call  the  resulting  graph  G'  and  note  that  it  has 
precisely  C+N  nodes.  Using  the  weighted  version  of  the  Lipton-Tarjan  planar 
separator  theorem  [LT80],  it  is  possible  to  bisect  the  real  nodes  of  G'  (by  assigning 
weight  1  to  the  real  nodes  and  weight  0  to  the  artificial  nodes)  without  cutting  more 
than  0((C+N),/2)  edges.  After  replacing  the  artificial  nodes  with  their  original  edge 
crossings,  it  becomes  apparent  that  we  have,  in  fact,  constructed  an  0((C+N)l/2) 
bisection  for  G.  Squaring,  we  find  that  C+N  £  QiB2)  □ 

Using  a  similar  proof  technique,  we  can  show  that  the  crossing  number  is  also 
close  to  an  upper  bound  for  the  layout  area  of  a  graph.  In  fact,  should  a  really  good 
layout  algorithm  for  planar  graphs  be  found,  then  the  following  result  could  become 
useful  in  laying  out  arbitrary  graphs. 

Theorem  2:  Given  an  optimal  drawing  D  for  an  N-node  graph  G  with  crossing 
number  C,  it  is  possible  to  construct  a  layout  for  G  with  area  at  most 
0((C + N)log\C + N)).  Should  a  procedure  be  found  which  lays  out  an  arbitrary 
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N-node  planar  graph  in  A(N)  area,  then  we  could  construct  a  layout  for  G  with  area  at 
most  0(A(C + N)). 

Proof:  As  in  the  proof  of  Theorem  1,  we  replace  each  edge  crossing  of  D  with  an 
artificial  node.  The  resulting  graph  G'  has  C+N  nodes  and  is  planar.  Using  the 
methods  developed  by  Lipton  and  Tarjan  [LT80J  and  Leiserson  [L80a],  G'  can  be 
laid  out  in  0((C + N)log*{C + A))  area.  It  is  then  a  simple  matter  to  replace  the 
artificial  nodes  with  their  original  edge  crossings  to  obtain  die  desired  layout  for  G. 
Alternatively,  should  an  A(JV)-area  planar  graph  layout  procedure  be  discovered,  we 
could  construct  an  0{A(C+ A))-area  layout  for  G  □ 

As  we  have  just  seen,  the  idea  of  replacing  edge  crossings  with  artificial  nodes  is 
simple  but  powerful.  Jai-Wei  and  Rosenberg  have  also  employed  this  strategy  in 
their  work  with  embeddings  of  graphs  in  binary  trees  [JR81J. 

(3b)  Specific  Lower  Bounds 

From  Theorem  1,  we  know  that  crossing  number  arguments  can  give  good  lower 
bounds  on  the  layout  area  of  many  graphs.  For  example,  we  could  simply 
reformulate  the  techniques  developed  by  Thompson  [T79,  T801  and  others  [V80, 
LS81]  in  terms  of  crossing  numbers.  Of  much  greater  interest,  however,  is  the  fact 
that  there  are  also  several  purely  combinatorial  techniques  for  proving  crossing 
number  lower  bounds.  (For  example,  see  Kleitman’s  work  [K70]  and  our  thesis 
(L81aJ.)  In  what  follows,  we  will  develop  a  new  procedure  for  proving  crossing 
number  lower  bounds  in  order  to  show  that  the  crossing  number  of  the  A-node  2- 
dimensional  mesh  of  trees  is  at  least  Sl(NlogN).  We  will  also  show  that  any  drawing 
of  the  A  node  r dimensional  mesh  of  trees  contains  an  edge  which  crosses  at  least 
fl(A/  //>)  other  edges. 

Our  results  require  the  following  standard  result  For  completeness  we  have 
included  one  of  the  many  proofs  from  [K70J. 

Lemma  1:  The  crossing  number  of  ,  the  complete  graph  on  A  nodes,  is  at  least 
A(A-/KA-2KA-i)//2d  for  N>5. 

Proof:  Let  D  be  a  drawing  of  KN  in  the  plane  with  the  smallest  possible  number 
of  crossings  C(A).  We  may  assume  that  no  pair  of  edges  which  cross  in  D  are 
incident  to  a  common  node.  Otherwise,  it  would  be  possible  to  produce  a  drawing 


D'  for  Kn  with  C(N)-l  crossings  by  exchanging  the  parts  of  the  crossing  edges 
which  lie  between  the  common  node  and  the  point  of  crossing.  This  would 
contradict  the  minimality  of  C(N). 

Consider  the  N  subdrawings  of  0  obtained  by  deleting  one  of  the  nodes  and  all 
of  the  edges  incident  to  it.  Note  that  each  crossing  of  D  appears  in  precisely  N-4  of 
the  subdrawings.  (A  crossing  does  not  appear  in  any  of  the  4  subdrawings  which 
correspond  to  the  deletion  of  a  node  incident  to  an  edge  of  the  crossing.)  Since  each 
of  the  subdrawings  is  a  drawing  of  KN.},  each  must  have  at  least  C(N-I)  crossings. 
Thus  ( N-4)C(N)  >  NC(N-l)  .  Applying  the  inequality  recursively  and  noting  that 
C(5)  =  /,  we  can  conclude  that 

C( AO  >  [N/(N-4)\  [(/V-/)/(^-5)l  •  •  •  [6/2] 

=  N(N-l)(N-2)(N-3yi20  for  N>5  □ 

Theorem  3:  The  crossing  number  of  the  N-node  2-dimensional  mesh  of  trees  is  at 
least  Sl{NlogN). 

Proof:  As  before,  let  M 2n  denote  the  2-dimensional  mesh  of  Uees  (where  n  is  a 
power  of  2).  We  will  show  that  the  crossing  number  of  M 2n  is  at  least 

( n2logn  -  121n2+  12!ri)/40  for  all  n>l. 

Since  M2n  has  N=Q(n2)  nodes,  this  will  be  sufficient  to  prove  the  desired  result 

The  proof  consists  of  two  steps.  In  the  first,  we  show  how  to  construct  a  drawing 
of  Kn2  from  any  drawing  of  M2  n  by  tracing  over  the  edges  of  M2  n  .  We  then  apply 
Lemma  1  to  conclude  that  there  are  a  large  number  of  crossings  among  the  edges  in 
the  top  levels  of  the  binary  trees  of  M2  n  .  In  the  second  step,  we  complete  the  proof 
by  inductively  applying  the  result  of  the  first  step. 

step  I:  Let  D  be  any  drawing  of  M2n  in  the  plane.  From  this  drawing,  we  can 
construct  a  drawing  O'  of  Kni  in  the  following  way.  First  locate  the  n2  leaves  of  the 
binary  trees  of  D.  They  will  serve  as  the  nodes  for  K, p  .  Given  any  pair  (ij)  and 
(k,I)  of  these  nodes,  draw  an  edge  from  ( i,j )  to  (k,l)  along  the  unique  path  from  ( ij ) 
to  (/,/)  in  the  ith  row  tree  of  D  and  then  from  ( i,l )  to  ( k,f)  in  the  lih  column  tree  of  D. 
(In  order  that  each  edge  not  be  drawn  twice,  we  shall  assume  that  Kk  and,  when 
/=  k ,  that  //.)  As  edges  in  D  will  be  traced  over  several  times  by  this  procedure,  it  is 
important  to  draw  the  edges  of  D '  so  that  no  pair  cross  each  other  more  than  once. 
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We  next  count  the  number  of  crossings  in  Z)'.  There  are  two  kinds  of  crossings 
to  consider.  The  first  kind  results  from  a  crossing  in  IX  More  precisely,  if  et  and  e2 
are  edges  of  M2  „  which  cross  in  D  and  c{  is  traced  over  Sj  times  while  e2  is  traced 
over  s2  times,  then  the  crossing  of  et  and  e2  will  appear  sfs2  times  in  D' .  This 
phenomenon  is  illustrated  in  Figure  6. 


•  12 
duplicated 
vN'xX  crossings 


T  original 

crossing  "^J8=i 


Figure  6:  Crossings  of  the  first  kind. 


The  second  kind  ot  crossing  results  from  edges  of  Kn2  which  must  cross  while 
traversing  a  common  edge  of  Z).  For  example,  see  Figure  7.  In  what  follows,  we  will 
show  that  the  number  of  crossings  of  the  second  kind  is  relatively  small. 


Figure  7:  Crossings  of  the  second  kind 


We  say  that  an  edge  is  type  i  if  it  is  in  the  ith  level  of  a  binary  tree  of  M2  n  .  It  is 
not  difficult  to  show  that  each  type  /  edge  is  traced  over  at  most 

nXHnl-n2?)  <  n3Xi 
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times  for  any  i<  logit  during  the  construction  of  O' .  Thus  at  most  crosses  of 

the  second  kind  can  occur  at  any  type  /  edge  of  D.  Since  there  are  2l+ln  type  / 
edges  in  M2n  ,  we  can  conclude  that  the  total  number  of  crosses  of  the  second  kind 
in  /)'  is  at  most 

X(2i+,nXn6?2H)  =  n7%P 

i-t 

*  «7- 

We  next  count  the  number  of  crossings  of  the  first  kind  (i.e.,  those  corresponding 
to  crosses  in  D).  We  say  that  a  crossing  of  D  is  type  i-j  if  it  is  the  crossing  of  a  type  / 
edge  and  a  type  j  edge.  Let  ti2  denote  the  number  of  type  i-j  crossings  in  D  and  set 

Mill 

h  ~  • 

Since  each  type  /  edge  is  traced  over  at  most  n3?'  times,  each  type  i-j  crossing  of  D 
produces  at  most 

(n3r%n3Tf)  =  n6r-i 

crosses  of  the  first  kind  in  /)' .  Thus  the  total  number  of  crossings  of  the  first  kind  in 
D'  is  at  most 


<*/  j** 


i*i  1  i't 


Summing,  we  find  that  the  total  number  of  crossings  of  either  kind  in  D '  is  at 


n7  +  n6Xl2iti 


By  Lemma  1,  this  number  must  be  at  least 


n2(n2-l)(n2-2)(n2-3yi 20  for  n2  >  5. 

Simplifying,  we  can  conclude  that 

'% r2it,  >  (n2-121riyi20  for  n>6. 

FT 

Let  sk-Ziti  be  die  number  of  crossings  involving  at  least  one  edge  from  the 
top  k  levels  of  some  binary  tree  of  M2  n  .  In  what  follows,  we  will  use  the  preceding 
inequality  to  show  that  sk  >  ( n2-121ri)k/40  for  at  least  one  value  of  k>1.  Assume 
otherwise  and  observe  that 
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%r\  =  2r2l(s,sH) 

1*1  1  i'W 

where  s0  is  defined  to  be  0.  The  coefficient  of  each  si  in  this  sum  is  positive  so  for 
each  i  we  may  substitute  ( n2-I2Jn)i/40  as  an  upper  bound  for  s(  in  order  to  see  that 

Hr*!,  <  I(n!-I2lny40)  ^ 

£%l  FT 

=  [(n2-l21n)/40]  2V' . 

A# 

tm in 

Since  2^  ^  7/J  for  all  n,  we  can  conclude  that 
*-# 

<  ( n2-121n)/l20 

for  all  «>/27,  a  contradiction.  Thus  for  all  n>!21,  there  is  a  k>l  such  that 

skZ(n2’l21n)k/40. 

step  2:  Let  C(ri)  denote  the  crossing  number  of  M2n  .  Using  the  result  of  step  1, 
we  will  now  show  by  induction  on  n  that  C(n)  >  ( n2logn  -  12ln2  +  I2Jn)/40  for  all 
n>l. 

As  ( n2logn  -  l2ln2+I2ln)/40  is  nonpositive  for  small  n.  the  lower  bound 
trivially  holds  for  all  /K72&  Assume  that  the  lower  bound  holds  for  all  nKn  where 
ri>128  and  let  D  be  any  drawing  for  M2n .  By  counting  the  crossings  of  D  in  two 
groups  according  to  whether  or  not  at  least  one  edge  of  the  crossing  is  contained  in 
the  top  k  levels  of  the  binary  trees  of  M2n  ,  we  can  observe  that 

C{n)  >  22kC(n2rk)  +  sk. 

(Recall  the  definition  of  sk  and  the  structure  of  M2  n  .)  By  choosing  k  as  in  step  1  so 
that  sk  >  (n2-121n)k/40  and  applying  the  inductive  hypothesis  for  C(n2'*),  we 
obtain 

a n)  >  22k[n2T2k{logn-ky40‘  121n2T2k/40  +  121nTk/40\  +  n2k/40  -  121nk/40 
2  n2logn/40  -  121n2/40  +  121n/40  +  121n(2k-k-iy40 
>  ( n2logn  -  J2Jn2  +  I21ny40  . 

Thus  the  inductive  hypothesis  is  established  and  we  can  conclude  that  the 
crossing  number  of  M2  n  is  at  least  Sl(n2logn)=Sl{NlogN)  □ 
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Theorem  4:  Any  drawing  of  the  N-node  r-dimensional  mesh  of  trees  contains  an 
edge  which  crosses  at  least  Q(Nl  ,/r)  other  edges. 

Proof:  The  r-dimensional  nxnx ■  >  >xn  mesh  of  trees  A frn  has  N  = 
(r+l)nr  ■  rn r'!  =  0(/ir)  nodes  for  bounded  r.  We  will  show  that  any  drawing  D  of 
Mrn  contains  an  edge  which  crosses  at  least  Q(nr'l)=U(Nl'l/l)  other  edges,  thus 
proving  the  theorem.  The  method  used  is  very  similar  to  that  of  Theorem  3. 

As  we  did  for  the  case  of  r-2  in  Theorem  3,  we  first  construct  a  drawing  D' of 
the  complete  graph  on  the  nr  leaves  of  Mrn  .  Each  type  /  edge  of  D  is  traced  over  at 
most  nr+1?'  times  by  this  procedure.  Thus  the  total  number  of  crossings  in  D'  is  at 
most 

( m3r+l)/2  +  n2r+2%T2iti 

Czf 

where,  as  before,  and  tu  is  the  number  of  type  if  crossings  in  D.  Applying 

Lemma  1,  we  can  conclude  that 

>^T2iti  >  Qin2^2) . 

£*'l. 

k 

Let  sk=  be  the  total  number  of  crossings  of  D  involving  an  edge  from  the  top 

iti 

k  levels  of  the  binary  trees  in  Mr  n  .  Using  arguments  similar  to  those  used  to  prove 
Theorem  3,  it  is  not  difficult  to  show  that  for  large  n,  there  exists  a  k  such  that 
Sf.  >  Q(n2r22k) .  As  there  are  only  rn^\2^+ 1-2)  edges  in  the  top  k  levels  of  Mrn 
for  any  k,  we  can  conclude  that  at  least  one  of  them  crosses  at  least  Q(/ir/)  other 
edges  □ 


4.  Wire  Area  Arguments 

As  we  have  just  seen,  crossing  number  arguments  can  be  very  powerful  in 
establishing  lower  bounds  on  layout  area  and  maximum  edge  length  for  VLSI 
networks.  Such  arguments  are  also  limited,  however,  to  the  kinds  of  results  obtained 
in  the  previous  section.  For  example,  in  our  thesis  [l >8  la],  we  show  that  the  crossing 
number  of  any  N-node  graph  with  an  Ofr^-separator  is  at  most  0(NlogN).  Thus, 
we  could  not  hope  to  improve  the  result  of  Theorem  3.  Nor  can  crossing  number 
arguments  be  used  to  prove  any  nontrivial  lower  bounds  for  planar  graphs. 

The  technique  of  using  a  drawing  of  a  network  to  construct  a  drawing  for  the 
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complete  graph  can  be  extended,  however.  In  what  follows,  we  use  this  technique  to 
find  better  lower  bounds  on  the  wire  area  of  certain  networks  than  were  possible 
^  with  crossing  number  arguments  alone.  More  precisely,  we  will  show  that  the  wire 

area  of  the  /V-node  2-dimensional  mesh  of  trees  is  at  least  Q,(Nlog2N)  and,  as  a 
corollary,  that  the  iV-node  (planar)  tree  of  meshes  has  wire  area  at  least  tt(NlogN). 
In  addition,  we  will  show  that  the  maximum  edge  length  of  the  A'-node  2-dimen- 
|  sional  mesh  of  trees  is  at  least  Q(Nl/2logN/loglogN)  while  that  of  the  N- node 

augmented  tree  of  meshes  is  at  least  £l(Nl/2/log1/2N). 

Theorem  5:  The  wire  area  of  the  N-nodc  2-dimensional  mesh  of  trees  is  at  least 
miog2N). 

Proof:  As  usual,  we  denote  the  nxn  mesh  of  trees  by  M2n  .  In  addition,  let  \V(n) 
denote  the  wire  area  of  M2n  and  let  a  be  a  positive  constant  such  that 

,  (*)  a  <  n/(4log2n)  for  all  n>2,  and 

oO 

(**)  a  <  22l'20Afi2i6)  for  all  i>l  where  )3  =  2  j  ~2  is  also  a  constant. 

Clearly  such  a  constant  exists  (a=T30  should  suffice)  and  clearly  W{n)  >  an2log2n 
|  for  n- 1  and  2.  Consider  a  value  of  n>4  which  is  a  power  of  2  and  assume  that  for 

all  values  of  nKn  which  are  powers  2  that  W(m)  >  am2log2rn  .  We  will  use 
induction  to  show  that  W(n)  >  an2log2n  .  Since  M 2n  has  N=0(n2)  nodes,  this  will 
be  sufficient  to  prove  the  theorem. 

S  Consider  any  layout  for  M2n  which  uses  W(n)  wire.  Partition  the  layout  into 

three  vertical  strips  V0 ,  Vf  and  V2  so  that  the  center  strip  contains  3n2/4  leaves  and 
each  outer  strip  contains  n2/8  leaves.  Similarly  partition  the  layout  into  three 
horizontal  strips  H0  ,  lil  and  H2  so  that  the  middle  strip  contains  3n2/4  leaves 
and  each  outer  strip  contains  n2/8  leaves.  For  example,  see  Figure  8. 

Let  d  denote  the  length  of  the  longest  side  of  the  center  block  formed  by  the 
intersection  of  Vj  and  Hl .  Without  loss  of  generality,  we  assume  that  the  longest 
<  side  is  horizontal.  In  what  follows,  we  will  show  that  d  >  (a ,/2nlogn)/8  . 

Since  each  of  the  regions  V(f\Hl  and  F?n///  can  contain  at  most  n2/S  leaves, 
it  is  clear  that  V pH t  contains  at  least  n2/2  leaves.  Consider  die  nv2  subgraphs 
of  M2  n  produced  by  eliminating  the  top  ( 3logn)/4  levels  of  the  row  and  column 
binary  trees  of  M2  n  .  Each  of  these  subgraphs  is  isomorphic  to  M2  ni/< .  By  the 
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Figure  8:  Partitioning  of  the  layout  for  M2n. 


pigeonhole  principle,  at  least  1/2  of  these  subgraphs  have  at  least  one  leaf  in 
VtC\ Hj.  If  d  <  ( a,/2nlogn)/S  (otherwise  we  arc  done),  then  at  most 
4d  <  ( a,/2nlogn)/2  edges  can  cross  the  boundary  of  V  f\ll  t  .  Tlius  at  most 
(a,/2nlogn)/2  of  the  subgraphs  which  have  at  least  one  leaf  in  VtC\ Hl  can  also 
have  a  node  or  part  of  an  edge  outside  F/fi//,  .  This  means  that  at  least 
(n3/2  -  a,/2nlogn)/2  copies  of  M2nj/4  are  wholly  contained  in  VpHt .  Applying 
the  inductive,  hypothesis,  we  conclude  that  F7n/fy  contains  at  least 

(n3/2  -  a  I/2nlogn)  W(n,/4) /  2  >  (an2 log2 n  -  a3/2n3/2log3n )  /  32 

>  ( an2log2n)/64  wire. 

The  last  inequality  follows  trivially  from  (*).  Thus  VpHj  has  at  least 
(an2log2n)/64  area  and  d  >  (a,/2nlogri)/8  ,  as  claimed. 

We  next  use  the  (Thompson  model)  layout  for  M2  n  to  construct  a  drawing  for 
the  complete  graph  on  n2  nodes  (namely,  the  n2  leaves  of  M 2  n ).  No  matter  how  the 
edges  of  the  complete  graph  are  drawn  in  the  plane  (e.g.,  they  may  cross  or  overlap), 
it  is  clear  from  Figure  8  that  the  sum  of  the  lengths  of  all  the  edges  (as  measured  in 
Euclidean  space)  is  at  least  n4d/64  >  ( aI/2n5logn)/2(> .  This  is  due  to  the  fact  that 
n4/64  edges  pass  from  region  V0  to  region  V2  and  that  these  regions  are  separated 
by  a  distance  d. 

Let  L{  denote  the  sum  of  the  lengths  of  the  edges  in  the  ith  levels  of  the  binary 
trees  of  M 2n  .  Since  every  level  /  edge  is  traced  over  at  most  n3Z'  times  in  the 
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drawing  of  the  complete  graph,  we  can  conclude  that 

>  ( al/2n5logn)/2 9 

i-l 

and  thus  that 

^£,2"'  >  (a,/2n2logn)/29 . 

In  particular,  this  means  that 

Lj  >  (ai/2n2logn2()/(29fii2) 

oO 

for  some  i  <  logn  .  (Recall  that  fi  =  2  j  2  •)  Otherwise, 

Li  <  (aI/2n2logn2ty(29fii2) 


for  1  <  i  <  logn  and  thus 

2V'  <  t^(o‘f/2n2logny(29pi2) 

i-l 

^  ( al/2n2logn)/2 9,  a  contradiction. 

Using  the  straightforward  relation 

W[n)  > 

where  /  has  been  chosen  so  that 

Lj  >  (al/2n2logn2ty(29fii2) , 

we  can  conclude  that 

£  22ia(nl)2(logn  -  i)2  +  (<xl/2n2logn2iy(29pi2) 

£  an2log2n  -  2ain2logn  +  (a1/2n2logn2hy(29(i i2) 

^  an2log2n  . 

The  last  inequality  follows  trivially  from  (**).  Thus  >  Qi^log2/ j)  for  all  n  □ 

Theorem  6:  The  wire  area  of  the  N-node  free  of  meshes  is  at  least  tt(NlogN). 

Proof:  As  we  showed  in  section  2c  of  this  paper,  the  jV-node  2-dimensional  mesh 
of  trees  can  be  embedded  in  an  0(NlogN)-no6e  tree  of  meshes.  From  Theorem  5, 
we  can  thus  conclude  that  the  wire  area  of  the  (NlogN)-no6e  tree  of  meshes  is  at 
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least  Q(Nlog2N).  Equivalently,  the  wire  area  of  the  N-node  tree  of  meshes  is  at  least 
U(NlogN)  □ 

Theorem  7:  Any  layout  of  the  N-node  augmented  tree  of  meshes  contains  a  wire  of 
length  at  least  Q(NI/2/logl/2N). 

Proof:  In  the  proof  of  l  heorem  5,  we  showed  that  any  layout  of  M,n  must  have 
two  leaves  which  are  spaced  at  least  il(nlogn)  distance  apart.  Since  Xf2n  can  be 
embedded  in  T 2n  so  that  the  leaves  of  M  2  n  are  embedded  in  or  near  the  leaves  of 
T2„  (see  the  embedding  in  section  2c),  we  can  observe  that  any  layout  of  T2n  must 
also  have  two  leaves  which  are  spaced  at  least  iUnlogn)  distance  apart.  Since  any 
pair  of  leaves  in  T 2n  are  linked  by  a  path  of  length  at  most  0 (logn)  in  T2n  \  we  can 
conclude  that  some  edge  of  hn  must  have  length  at  least  Q(n)  =  tt(Nl/2/logl/2N)  □ 

It  is  worthwhile  to  point  out  that  we  could  have  proved  both  Theorems  6  and  7 
directly,  using  arguments  similar  to  the  ones  used  to  prove  Theorem  5. 

Theorem  8:  Any  layout  of  the  N-node  2-dimensional  mesh  of  trees  contains  a  wire 
of  length  at  least  tt(N,/2logN/loglogN). 

Proof:  It  is  sufficient  to  show  dial  any  layout  for  M2n  contains  a  wire  of  length 
at  least  Sl{nlogn/loglogn).  Assume  for  the  purposes  of  contradiction  that  this  is  not 
the  case  and  consider  a  layout  of  Xl2n  for  which  the  longest  wire  has  length 
q  <  o(nlogn/'loglogn) .  We  first  show  that  (without  loss  of  generality)  the  area  of 
such  a  layout  is  at  most  0 (q2log2n)  <  o(n2log4n )  . 

Since  every  pair  of  nodes  of  M2n  is  linked  by  a  path  of  length  at  most  4logn ,  all 
of  the  nodes  in  the  layout  are  contained  in  a  4qlogn  x  4qlogn  square.  At  most 
l6qlogn  wires  may  leave  and  re-enter  the  square  at  various  points  along  its  perimeter. 
Without  increasing  the  lengths  of  any  of  these  wires,  it  is  possible  to  rewire  the 
segments  outside  the  square  using  at  most  0{q2lo^n)  additional  area.  Ill  us,  the 
resulting  layout  for  M2n  will  have  maximum  edge  length  q  and  area  at  most 
0(q2log2n). 

The  proof  is  completed  by  observing  that  any  layout  of  A/ 2  n  with  area  o (n2log4n) 
must  have  a  wire  of  length  at  least  S }{nlogn/loglogri).  From  the  proof  of  Theorem  5, 
we  know  that 

>  (a,/2n2lognY29 . 
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Thus  either 

1)  there  is  an  /  <  4loglogn  such  that 

Lj  >  (a  l/2n2Iogn2i)  /  (2,2loglogn) ,  or 

2)  there  is  an  i  >  4loglogn  such  that 

Lf  >  (al/2n2logn2)/(2I0fii*) 
where,  as  before,  the  constant  =  2  7  *2-  Otherwise, 

j** 

2^'  =  111?1  + 

<  ( a!/2n2logn)/2 10  +  [(a  ,/2n2logn)/2,0(i] 

^  ( a,/2n2logn)/2 9  ,  a  contradiction. 

The  second  condition  cannot  possibly  be  true,  however.  If  it  were,  the  area  of  the 
layout  would  be  at  least  I,  >  Sl(n2logn2i/i2)  which,  for  /  >  4loglogn  ,  means  that 

A  >  Sl(n2loginA.loglogri)2) 

>  Q(n2log*n) ,  a  contradiction. 

Thus  the  y?rs/  condition  must  be  true  and  there  is  an  /  such  that 
Lj  >  Q(n2logn2'/loglogn )  .  Since  there  are  n2,+ 1  type  i  edges  in  M2n  ,  we  can 
conclude  that  at  least  one  of  them  has  length  at  least  Q(nlogn/log!ogn)  □ 


5.  Remarks 

(a)  In  addition  to  being  good  lower  bounds  for  layout  area,  the  crossing  number 
and  wire  area  of  a  network  are  interesting  in  their  own  right  In  particular,  both  are 
worth  minimizing  when  designing  a  chip.  For  instance,  a  chip  with  a  large  number 
of  crossings  may  have  problems  with  capacitive  coupling  (i.e.,  interference  between 
overlapping  wires);  particularly  if  some  of  the  wires  cross  an  unusually  large  number 
of  the  other  wires  (as  was  the  case  with  the  r-dimensional  mesh  of  trees).  The  wire 
area  is  worth  minimizing  in  order  to  maximize  the  chip  yield.  As  many  chips  are 
mined  by  localized  random  errors,  chips  with  lower  wire  density  will  be  less  likely  to 
be  affected  by  such  problems. 
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(b)  Unfortunately,  our  results  indicate  that  both  the  crossing  number  and  the 
wire  area  are  usually  as  large  (up  to  a  constant)  as  the  area  of  the  layout.  In  fact,  all 
of  the  previously  known  nontrivial  lower  bounds  for  layout  area  are  also  lower 
bounds  for  crossing  number  and  wire  area.  This  is  due  to  the  fact  that  the  previously 
known  lower  bounds  for  layout  area  were  proved  as  a  consequence  of  the  identity 
A  >  ^(B2).  Since  we  showed  in  Theorem  1  that  C+N  >  QiB2),  the  same  bounds 
also  hold  for  crossing  number  and  wire  area.  For  example,  this  means  that  any 
layout  of  a  network  which  computes  an  /V-point  Fourier  transform  in  T  steps  must 
have  Ui^/T2)  wire  crossings.  Hence,  we  could  thus  conclude  that  the  /V-node 
shuffle  exchange  graph  and  the  /V-node  cube-connected-cycles  graph  have  crossing 
number  Sl(N2/log2N). 

(c)  The  previous  analysis  can  often  be  carried  one  step  further  in  order  to  show 
that  some  wires  in  a  layout  must  cross  many  other  wires.  For  example,  any  network 
which  computes  an  /V-point  Fourier  transform  in  T  steps  must  have  a  wire  which 
crosses  Q(N/7'2)  other  wires.  This  is  because  the  network  contains  0(7V)  wires  but 
has  at  least  Sl(N2/T2)  crossings.  In  particular,  this  means  that  any  layout  of  the  N- 
node  shuffle-exchange  graph  or  the  /V-node  cube-connected  cycles  graph  contains  a 
wire  which  crosses  Q(N/log? /V)  other  wires. 

(d)  The  techniques  developed  in  this  paper  can  also  be  used  to  reprove  other 
results  in  the  literature.  For  example,  Brent  and  Rung  showed  in  [BK80]  that  any 
layout  of  the  complete  /V-node  binary  tree  in  which  the  leaves  are  contained  on  the 
boundary  of  some  convex  region  requires  at  least  tt{NlogN)  wire  area.  Subsequently 
Patterson,  Ruzzo  and  Snyder  [PRS81]  showed  that  any  such  layout  with  area  A  must 
have  some  wire  of  length  tt(N/log(A/N)).  As  we  show  in  [L81a],  both  of  these 
results  can  be  simply  proved  using  the  techniques  used  to  prove  Theorems  5  and  8. 

(e)  The  methods  which  we  have  used  to  prove  crossing  number  and  wire  area 
lower  bounds  can  also  be  used  to  prove  bisection  width  lower  bounds.  For  example, 
these  techniques  can  be  easily  used  to  show  that  the  bisection  width  of  the  /V-node 
shuffle-exchange  graph  is  at  least  R (N/logN)  [L81a].  In  this  case  the  construction  of 
the  complete  graph  from  the  drawing  of  the  shuffle-exchange  graph  bears  a  strong 
resemblance  to  Thompson's  [T80]  information  flow  arguments.  In  fact,  it  appears 
that  many  of  Thompson’s  lower  bounds  can  be  reproved  in  this  fashion. 
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(0  When  defining  the  2-dimensional  mesh  of  trees  in  section  2a,  we  required 
that  the  binary  trees  be  constructed  so  that  M 2„  contain  22k  disjoint  copies  of  Minlk 
as  subgraphs  for  any  k.  It  is  interesting  to  note  that  networks  which  do  not  satisfy 
this  constraint  but  which  are  similar  to  the  2-dimensional  mesh  of  trees  in  all  other 
respects  have  essentially  the  same  computational  power  as  the  more  restricted  mesh 
of  trees.  Theorems  3,  5  and  8  do  not  necessarily  apply  to  such  networks,  however. 
In  fact,  we  do  not  know  if  the  same  wire  area  lower  bound  need  still  apply.  Using  a 
somewhat  different  method,  however,  we  have  shown  in  [L81a]  that  any  such 
network  must  still  have  a  large  crossing  number.  As  a  key  step  in  the  proof,  we 
generalize  Lemma  1  to  show  that  any  /V-node  graph  with  E  edges  has  crossing 
number  at  least  Q(E3/N '*)  whenever  E  >  4N  . 

(g)  The  area  required  to  lay  out  the  r-dimensional  mesh  of  trees  is  very  close  to 
that  required  to  lay  out  the  standard  mesh  of  the  same  size.  For  example,  for  r>2  the 
amount  of  area  required  for  the  mesh  of  trees,  000.  is  at  most  a  constant  times  as 
large  as  the  area  required  for  the  standard  rdimensional  mesh.  Thus  the 
computational  power  of  the  standard  mesh  can  be  greatly  enhanced  at  little  or  no 
cost  in  layout  area  by  adding  the  edges  of  the  mesh  of  trees. 

<b)  The  ^dimensional  mesh  of  trees  was  defined  as  a  natural  generalization  of 
the  computationally  powerful  2-dimensional  mesh  of  trees.  Mnn  can  also  be  viewed 
as  a  generalization  of  the  rcube,  also  a  very  powerful  communications  network. 
(For  example,  Mt2  is  an  r-cube  with  every  edge  replaced  by  a  path  of  length  2.) 
Viewed  in  this  light,  the  ^dimensional  mesh  of  trees  motivates  the  definition  of  a 
shuffle-iree  graph  in  the  same  way  that  the  rcube  motivates  the  definition  of  the 
shuffle-exchange  graph.  We  would  be  interested  to  know  if  there  any  practical 
applications  of  such  a  general  comunications  network. 

(i)  Using  standard  techniques,  it  is  not  difficult  to  show  that  all  of  the  asymptotic 
lower  bounds  proved  in  Theorems  3-8  are  tight  (although  it  is  likely  that  the  leading 
constants  can  be  substantially  improved).  In  addition,  the  drawings  and  layouts 
which  achieve  the  lower  bounds  for  crossing  number  and  wire  area  also  achieve  the 
lower  bounds  for  maximal  edge  crossing  and  edge  length.  Thus  there  are  no 
area/edge  length  tradeoffs  for  these  networks. 

(j)  After  writing  the  initial  version  of  this  paper,  we  became  aware  of  several 
other  papers  which  describe  parallel  computation  algorithms  using  the  mesh  of  trees. 
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For  example: 

1)  Nath,  Maheshwari  and  Bhatt  [NMB81]  have  used  the  network  (which  they  call 
the  orthogonal  trees  network)  for  sorting,  discrete  Fourier  transform,  minimum 
spanning  tree,  and  connected  components  (as  well  as  many  other)  problems, 

2)  Cappello  and  Steiglitz  [CS81]  have  used  the  network  (which  they  call  the 
orthogonal  forests)  for  integer  multiplication,  and 

3)  Gannon  [G81]  has  used  the  network  to  find  approximate  solutions  to  systems 
of  partial  differential  equations. 

(k)  Subsequent  to  the  final  writing  of  this  paper,  a  great  deal  more  was 
discovered  about  upper  and  lower  bounds  for  layout  area  and  crossing  number.  As  a 
forward  pointer,  we  refer  the  interested  reader  to  [L82].  Interestingly,  no  further 
progress  has  been  made  on  the  planar  layout  problem. 
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